Mosaicing-by-recognition for recognizing texts captured in multiple video frames

نویسندگان

Seiichi Uchida

Hiromitsu Miyazaki

Hiroaki Sakoe

چکیده

Text recognition in video frames is promising because of its following superiorities over text recognition in a still camera image: (1) it is possible to recognize longer texts by concatenating the frames, and (2) it is also possible to improve the quality of the text image by integrating the frames. In this paper, a mosaicing-by-recognition technique is proposed where video mosaicing and text recognition are simultaneously and collaboratively performed in a one-step manner by a dynamic programming-based optimization algorithm. In this optimization algorithm, rotation, scaling, vertical shift, and speed fluctuation of camera motion are efficiently compensated. The results of experiments to evaluate not only the accuracy of text recognition but also that of video mosaicing indicates that the proposed technique is practical and can provide reasonable results in most cases.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hand Gesture Recognition from RGB-D Data using 2D and 3D Convolutional Neural Networks: a comparative study

Despite considerable enhances in recognizing hand gestures from still images, there are still many challenges in the classification of hand gestures in videos. The latter comes with more challenges, including higher computational complexity and arduous task of representing temporal features. Hand movement dynamics, represented by temporal features, have to be extracted by analyzing the total fr...

متن کامل

Detection and Recognition of Multi-language Traffic Sign Context by Intelligent Driver Assistance Systems

Design of a new intelligent driver assistance system based on traffic sign detection with Persian context is concerned in this paper. The primary aim of this system is to increase the precision of drivers in choosing their path with regard to traffic signs. To achieve this goal, a new framework that implements fuzzy logic was used to detect traffic signs in videos captured along a highway f...

متن کامل

Robust Video Mosaicing through Topology Inference and Local to Global Alignment

The problem of piecing together individual frames in a video sequence to create seamless panoramas (video mosaics) has attracted increasing attention in recent times. One challenge in this domain has been to rapidly and automatically create high quality seamless mosaics using inexpensive cameras and relatively free hand motions. In order to capture a wide angle scene using a video sequence of r...

متن کامل

Image Mosaicing using first order Moments and Pixel to Pixel Comparison

In real-life situations, the field of view of the imaging devices is usually smaller than the scene to be imaged, due to the inherent limitations of the imaging devices. Even if the whole scene is captured in a single exposure, it could result in an image having poor resolution. In such circumstances, it will never be possible to capture the scene as a single image for further processing or hum...

متن کامل

Video Mosaicing using Manifold Projection

Video mosaicing is commonly used to increase the visual eld of view by pasting together many video frames. Existing mosaicing methods are eeective only in very limited cases where the image motion is almost a uniform translation or the camera performs a pure pan. Forward camera motion or camera zoom are very problematic for traditional mosaicing. A mosaicing methodology to allow image mosaicing...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2005

Mosaicing-by-recognition for recognizing texts captured in multiple video frames

نویسندگان

چکیده

منابع مشابه

Hand Gesture Recognition from RGB-D Data using 2D and 3D Convolutional Neural Networks: a comparative study

Detection and Recognition of Multi-language Traffic Sign Context by Intelligent Driver Assistance Systems

Robust Video Mosaicing through Topology Inference and Local to Global Alignment

Image Mosaicing using first order Moments and Pixel to Pixel Comparison

Video Mosaicing using Manifold Projection

عنوان ژورنال:

اشتراک گذاری